Picture for Yutong He

Yutong He

Beijing National Research Center for Information Science and Technology

DFSAttn: Dynamic Fine-grained Sparse Attention for Efficient Video Generation

Add code
May 22, 2026
Viaarxiv icon

RoPeSLR: 3D RoPE-driven Sparse-LowRank Attention for Efficient Diffusion Transformers

Add code
May 20, 2026
Viaarxiv icon

FedSLoP: Memory-Efficient Federated Learning with Low-Rank Gradient Projection

Add code
Apr 27, 2026
Viaarxiv icon

Diamond Maps: Efficient Reward Alignment via Stochastic Flow Maps

Add code
Feb 05, 2026
Viaarxiv icon

Mixture-of-Channels: Exploiting Sparse FFNs for Efficient LLMs Pre-Training and Inference

Add code
Nov 12, 2025
Viaarxiv icon

An All-Reduce Compatible Top-K Compressor for Communication-Efficient Distributed Learning

Add code
Oct 30, 2025
Viaarxiv icon

Efficient Long-Context LLM Inference via KV Cache Clustering

Add code
Jun 13, 2025
Figure 1 for Efficient Long-Context LLM Inference via KV Cache Clustering
Figure 2 for Efficient Long-Context LLM Inference via KV Cache Clustering
Figure 3 for Efficient Long-Context LLM Inference via KV Cache Clustering
Figure 4 for Efficient Long-Context LLM Inference via KV Cache Clustering
Viaarxiv icon

Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation

Add code
Jun 09, 2025
Viaarxiv icon

Benchmark of Segmentation Techniques for Pelvic Fracture in CT and X-ray: Summary of the PENGWIN 2024 Challenge

Add code
Apr 03, 2025
Viaarxiv icon

CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models

Add code
Feb 03, 2025
Figure 1 for CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models
Figure 2 for CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models
Figure 3 for CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models
Figure 4 for CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models
Viaarxiv icon